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Category  A 

1.  Clostridium  botulinum 

2.  Bacillus  anthracis  (anthrax) 

3.  Francisella  tularensis  (tularemia) 

4.  Yersinia  pestis 


Category  C 

1.  Mycobacterium  tuberculosis 
(multiple  drug  resistant) 

2.  Rickettsias  (other) 


Category  B 

1.  Brucella  species  (brucellosis) 

2.  Burkhoderia  pseudomallei 

3.  Burkholderia  mallei  (glanders) 

4.  Campylobacter  jejuni 

5.  Clostridium  perfringens  (epsilon 
toxin,) 

6.  Coxiella  burnetti  (Q  fever) 

7.  Escherichia  coli  (diarrheagenic) 

8.  Listeria  monocytogenes 

9.  Rickettsia  prowazekii  (typhus  fever) 

10.  Salmonella 

11.  Shigella  species 

12.  Staphylococcus  aureus  (enterotoxin 

B) 

13.  Vibrios  (pathogenic) 

14.  Yersinia  enterocolitica 


Genomes  of  all  above  organisms  have  been  sequenced 


5/12/2004  versus  (11/12/2004) 


Number  of  Fully  Sequenced  Genomes  of  Eubacteria:  145  (178)  (From  1995) 
Number  of  Fully  Sequenced  Genomes  During  Last  12  Months:  55 168} 


Prokaryotic  Ongoing  Genome  Projects:  489  {528} 

Archaeal:28  {27}  Bacterial: 


461  {4991 


Example  of  Ongoing  Genome  Projects 


Bacillus  anthracis  A1055  (Group  C) 

Bacillus  anthracis  Ames  Ancestor 

1 

Bacillus  anthracis  Ames  Florida 

Bacillus  anthracis  Austrailia  94  (GT55,  Group  A3a) 

Bacillus  anthracis  CNEVA-9066  (GT79  Group  B2) 

Bacillus  anthracis  Kruger  B  (GT87  Group  B1) 

Bacillus  anthracis  Vollum  (GT77  Group  A4) 

Bacillus  anthracis  Western  N.  America  (GT3  Group  Ala) 
Bacillus  anthracis  STN 

Bacillus  anthracis  ZK 

First,  some  terminology... 


Taxonomy 

the  science  of  naming  and  classifving  organisms; 

Classification  - 

placement  of  an  organism  within  a  scheme  relating 
different  groups  of  organisms; 

Identification  - 

the  determination  of  whether  an  organism  should  be 
placed  within  a  group  of  organisms  known  to  fit  within 
some  classification  scheme; 

(the  practical  use  of  classification  criteria) 

Phylogenetics  - 

focuses  on  evolutionary  relationships  between 
organisms  or  genes/proteins 

Phylogenetic  Approach 

The  ideal  means  of  identifying  and  classifying  bacteria  would  be  to  compare 
each  gene  sequence  in  a  given  strain  with  the  gene  sequences  for  every 
known  species. 


Taxonomy  of  Bacteria 

(Linnaean  System) 


Universal  Phylogenetic  Tree  of  Bacteria 
Based  on  SSU  rRNA  Sequences 


Approach:  (1)  Bacterial  Sample  Processing 


.  3  *  Jf*\  A 


§yi#» 

DNA 

CCTGAGCCA 

ATTGATGAA 

1 

I 


transcription 


i 


mRNA 


CCUGAGCCA  AUUGAUGAA 


1 

PEP  IDE 


The  relation  between  the 
sequence  of  bases  in  DNA  and 
the  sequence  of  amino  acids  in 
a  protein 


Approach:  (2)  Tandem  Mass  Spectrometry  of  Peptide  Ions 


Fragment 
Ions  Out 


Approach:  (3)  Sequencing  of  Peptide  Ions 


MS/MS  Spectra  of  Peptide  Ions 


Unknown  Bacterium 


Approach:  (4)  Matching  of  Identified  Tryptic  Peptides  to 
Theoretical  Peptides  of  Database  Bacterial  Proteomes 
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Sequences 


Classification  and  Identification  of  Unknown  Bacterium 
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Cluster  Analysis 


Screen  Capture  Image  Displaying  Raw  LC-MS/MS  Data 
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Analysis  of  Bacterial  Mixture 
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Principal  Component  Analysis 
of  Peptide  Assignments 


Representation  of  the  database  Bacillaceae  species  and  unknown  organism 
in  the  principal  component  space  (PC  1,  PC  2,  PC  3)  reflecting  80  %  of  the  total 
information  included  in  the  assignment  matrix  of  125  amino  acid  peptide 
sequences  to  bacterial  proteomes. 
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Hierarchical  clustering  of  Bacillaceae  species  in  125-dimensional  space  of  peptide  sequences. 

(Complete  linkage;  squared  Euclidean  distances) 
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Unknown  sample:  correctly  identified  as  B.  cereus  ATCC  14789 
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Selected  Peptide  Sequences  Discriminating 
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Protein 

60  kDa  chaperonin  GROEL 
Hypothetical  protein  BC0479 
Putative  transcriptional  regulator 
Spore  coat-associated  protein  N 

Hypothetical  protein  BA  3347  [Bacillus  anthracis  Ames] 
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Location  on  the  chromosome  of  Bacillus  cereus  ATCC  14579  (Mbp) 


CONCLUSIONS 


The  results  demonstrate  that  mass  spectrometry-based 
proteomics  approach  allows  for  : 


High  confidence  level  classification  and  identification 
of  bacteria  based  on  genome  traceable,  proteomic  similarities 
and  differences  between  an  analyzed  microorganism  and 
reference  bacteria; 


Identification  of  pure  cultures  as  well  as  mixtures  of 
microorganisms. 


